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High-dimensional entanglement is an important physical resource for quantum communication. 

A basic issue for any communication scheme is how many shared bits two two parties can extract 
subject to experimental noise. We determine the shared information that can be extracted from 
time-bin entangled photons using frame encoding. We consider photons generated by a general 
down-conversion source and also model losses, dark counts and the effects of multiple photons 
within each frame. Furthermore, we describe a procedure for including other imperfections such 
as after-pulsing, detector dead-times and jitter. The results are illustrated by deriving analytic 
expressions for the maximum information that can be extracted from high-dimensional time-bin 
entangled photons generated by a spontaneous parametric down conversion. A key finding is that 
under realistic conditions and using standard SPAD detectors one can still choose frame size so as to 
extract over 10 bits per photon. These results are thus useful for experiments on high-dimensional 
quantum-key distribution system, but are not limited to such systems. For example, the results are 
also useful in determining the limits of fibre arrays or within time-multiplexing schemes. 


I. INTRODUCTION 


It is well know that entangled photons can be used 
to extract shared random bits. The number of ex¬ 
tractable bits per photon pair depends on the di¬ 
mensions of the entangled degree of freedom. For 
example, polarization entanglement allows at most 
one shared bit per photon pair. An alternative is to 
use the arrival time of a photon. Encoding within 
the arrival time of a pair of photons offers an exper¬ 
imentally viable wayof generating high-dimensional 
entangled states [ll-lj. High-dimensional entangled 
states have many interesting properties and can 
allow for multiple shared bits extracted from each 
photon pair. This can be beneficial for quantum 
key distribution (QKD), where each detected pho¬ 
ton pair could encode over 10 bits of information 


There are several benefits to encoding within the 
time of arrival as opposed to other degrees of free¬ 
dom, such as the spatial modes. One key advan¬ 
tage is to minimize the effects of detector dead-time, 
which not only limits the rate at which information 
can be communicated, but also impacts on security 
within QKD [§, [l^ . Another benefit is that tempo¬ 
ral modes can be easily coupled into fibres, which is 
not the case for beams with non-zero orbital angular 
momentum dMl. 

It is clear that imperfections such as loss have a 
strong effect on how much information we can ex¬ 
tract from high-dimensional entangled photons. It 
is thus vital to model the effects of realistic experi¬ 
mental errors. Any model must take account of the 
photons source, channel losses and imperfect detec¬ 


tors. Nevertheless, it has been shown that it may 
still be possible to extract over 10 bits per photon 
pair under reasonable experimental conditions [l^ . 

In practice, the amount of extractable information 
depends on the error correcting scheme. In turn, this 
can depend on the physical implementation. The 
case of time-bin encoding raises specific problems. 
For example, in standard polarization based QKD 
schemes, one uses the timing information to help 
correct losses. It is thus possible to remove all the 
cases where Alice and Bob do not share coincident 
photons. This approach is clearly not suitable when 
information is encoded in the arrival time. Instead, 
we require a method for correcting errors that does 
not reveal the timing information. A common way 
of circumventing this is to split the arrival time into 
time-bins, which are then grouped together to form 
a frame [l5j | . Alice and Bob then publicly announce 
the number of photons detected in each frame. The 
use of frame encoding, while greatly facilitating error 
correction, does add an additional constraint to the 
extractable information. A realistic model must take 
this into account. 

The aim of this work is to determine the 
extractable information from high-dimensional, 
temporal-entangled photons. In particular, we 
determine the maximum number of shared bits 
that, on average, one obtains using frame encoding 
schemes. It is important to stress that while the 
main motivation for this work comes from QKD, we 
are not proposing a new QKD protocol. As such we 
do not concern ourselves with the task of securing 
the bits. Instead, we establish the maximum shared 
information that can be obtained via reconciliation. 
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The task of securing the bits will generally depend 
on the exact nature of the setup. 

The results we present are not only useful for 
QKD. For instance, it is has been argued that the 
mutual information can be used to quantify the en¬ 
tanglement within and SPDC source [l^. Further¬ 
more, the results can also be used to quantify the 
capacity of fibre array [l^, which can be used, for 
instance, in time-multiplexing of detectors [l^[20|. 

The general formalism we present can model ex¬ 
periments such as illuminating a nonlinear crystal 
with a mode-locked laser,(see Fig[T]and the descrip¬ 
tion in the next section). The approach is, however, 
not tied to this setup and applies to general sources 
of entangled photons. For instance, the formalism 
can be applied to cases where the Poissonian approx¬ 
imation is not appropriate. One could thus use our 
approach to model many different time-bin based 
experiments. In addition, the formalism also takes 
account of asymmetric channel losses, dark counts, 
jitter and other such effects. The breadth and gen¬ 
erality of the considered errors is be yon d that which 
is considered in previous works . 

To understand how these results can be useful, 
consider a QKD experiment with detector jitter. A 
common approach to reducing jitter is to increase 
the width of the time-bins. This is, however, not al¬ 
ways possible or practical. Furthermore, even when 
we can increase the time-bin width, this affects the 
amount of information one can extract. In this con¬ 
text, an important question is whether it is better 
to increase the width of the time-bins or to correct 
the jitter errors using a reconciliation protocol. To 
answer this question one must calculate the mutual 
information in the presence of jitter. The can be 
achieved using the results of this paper. 

Another way in which our approach goes beyond 
existing results, such as [H, is to calculate ex¬ 
plicitly the effect of frames that contain two or more 
photon pairs. Our findings are thus complementary 
to those of [ 2 ^, which presents a layered protocol 
for extracting information from general multi-array 
frames. The aim of the current work is to find 
the maximum possible extractable information us¬ 
ing any frame-encoding protocol. This should prove 
important for optimization and design of new error- 
correction codes for high-dimensional QKD. 


II. FRAME ENCODING 

Pairs of photons have been prepared experi¬ 
mentally where their arrival time is entangled [I[. 
A common way of generating such photons pairs 
is to use spontaneous parametric down-conversion 
(SPDC) [i,[il[i3- Figure [T] shows a typical setup, 
where a nonlinear crystal is pumped by a mode- 


locked laser [2l|, . The incoming pulses are classi¬ 

cally coherent. As down-conversion is a unitary pro¬ 
cess, the coherence between the pulses is transferred 
to a coherence between the amplitudes to generate 
photon pairs in each time-bin. In the ideal case, 
two parties, called Alice and Bob, use this setup 
to generate a random sequence photon pairs that 
are perfectly correlated in time. Alice and Bob then 
use single-photon counter modules and synchronized 
time-tagging devices to obtain the timing informa¬ 
tion. Setups such as this have been realized experi¬ 
mentally (2ll - l^ . 

To make use of such time-entangled states, the 
arrival time is divided into a discrete set of time- 
bins. For the case of a mode-locked train of pulses, 
the time-bin width is set by the pulse spacing. In 
alternate setups where photons are generated by a 
single pulse or a continuous-wave laser, the time is 
discretized by dividing the time into discrete time- 
bins. If the widths of these time-bins are chosen ap¬ 
propriately, then Alice and Bob should detect their 
photons within the same time-bin. The uncertainty 
in the arrival time can then be used to extract shared 
random bits. An eavesdropper could then be de¬ 
tected by measuring within another basis [26l - [3^ . 


Down conversion 



FIG. 1: A schematic for a experimental setup that gen¬ 
erates and distributes high-dimensional, time-of-arrival 
entangled photons. A mode-locked laser generates a co¬ 
herent train of pulses. The pulses pumps a nonlinear 
crystal that produces entangled photon pairs in some 
of the time-slots; many of the slots contain no photons 
(color online). 

In real experiments, there will always be errors. 
Alice and Bob will thus carry out error-correction to 
obtain a shared random string. A simple approach 
is to group to geth er several contiguous time-bins to 
form a frame [l^. For each frame, Alice and Bob 
announce the number of time-bins in which they de¬ 
tect photons. Let Ka and Kb denote the number of 
time-bins in which photons were detected by Alice 
and Bob respectively.^ One can use Ka and Kb io 


^ Note that Ka and Kb do not correspond to the number 
of photons in Alice and Bob’s frames. For example, Ka = 
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classify the frames; we thus write {Ka, -?fB)-frames, 
to denote the class of frames where Alice see Ka 
clicks while Bob sees Kb- Error-correcting codes 
can then be developed to deal with each class of 
frame. In many setups, the chance that Alice and 
Bob will detect multiple photons within a frame is 
low. In such situations, it is sufficient to consider 
only cases such as (1,1) and (2, 2)-frames. 

We envisage more complicated frame-encoding 
schemes, where Alice and Bob don’t publicly an¬ 
nounce Ka and Kb- In all that follows, we con¬ 
sider only simple schemes where Ka and Kb are 
announced. The reason is that an understanding 
of this situation is vital also for the more compli¬ 
cated protocols. In particular, we will show that the 
shared information for the case where Ka and Kb 
are announced differ from the more complicated pro¬ 
tocols by a single term. Thus, the results we present 
can also be used to calculate the shared information 
for more advanced protocols. 

Let N represent the number of time-bins that 
comprise each frame. If Alice observes clicks in Ka 
time-bins, then the number of possible distinct mea¬ 
surement records she could have is given by the Bi¬ 
nomial coefficient 


A particular record of measurement results, or mea¬ 
surement patterns, can be denoted by an A^-bit bi¬ 
nary string that indicate the location of the time- 
bins where photons are detected. For example, if 
N = A, Ka = 2, and Alice sees clicks in the first 
and third time-bins, then the corresponding binary 
string is 1010. It will prove useful to introduce a 
further piece of notation. We denote Alice’s mea¬ 
surement patterns symbolically as , where r is 
the binary string that uniquely describes the pat¬ 
tern. We describe Bob’s measurement pattern using 
the same notation, where each A is changed to B. 

If there are no errors, then Alice and Bob each 
see the same measurement pattern. When losses 
are present, then Alice and Bob’s measurement pat¬ 
terns can be different. Nevertheless, there will still 
be some correlation in their results. We expect that 
it should still be common to observe clicks within the 
same time-bins. For a particular frame, let L be the 
number time-bins in which they both share clicks. 
For example, if Alice has the pattern 0101, while 
Bob sees the pattern 1100, then L = 1. Clearly, 
Mm{KA, Kb} > L- Furthermore, the allowed val¬ 
ues for L satisfy the inequality N > Ka Kb — L- 


m 


KaI{N-Ka)\' 


( 1 ) 


1 means that Alice detected a click within one time-bin. 
In principle, it is possible that this click corresponded to 
multiple photons or even a dark count. 


For fixed values of Ka-, Kb and L, the total num- 
ber of different joint patterns for Alice and Bob is 

nL{KA,KB) = N\I[L\{Ka-L)\{Kb-L)\{N-Ka- 
Kb -\- L)\] which is a multinomial coefficient. These 
observations will prove useful later on. 

We want to determine the information contained 
within an average frame. The mutual information 
per frame is denoted as Hj-rameiA : B). To find 
the number of shared bits per photon, we divide 
Hframe{A '- B) by the average number of pho¬ 
ton pairs found within a frame. We then calculate 
Hframe{A '- B) using the method outlined in fl^ . 
For a frame encoding scheme, we will not reach the 
bits per photon limit set by Hframe{A : B). The rea¬ 
son for this is that, in a frame encoding scheme, Alice 
and Bob publicly announce the number of clicks they 
see in each frame. They then apply error correction 
codes individually to each class of frame. This nec¬ 
essarily results in a loss of randomness, and hence, 
of random bits. 

The shared information that we can extract is re¬ 
lated to the mutual information. However, it is not 
Hframe{A '- B), but instead the mutual information 
post-selected on when Ka and Kb have specific val¬ 
ues. This implies that we must use conditional prob¬ 
abilities in place of the standard probabilities to de¬ 
termine the conditional mutual information 

Suppose Alice observes x clicks while Bob sees y, 
the maximum shared information per frame is given 
by the conditional mutual information 


HiA:B\KA=x,KB=y) = 
-Er,sPiA^r,By\KA=X,KB=y) 


X log 2 


P{A^,By\KA=x,KB=v) 

P{A^\KA=x)P{BS\KB=y) 


( 2 ) 


where P{Af,By\KA = x,Kb = y) is the joint con¬ 
ditional probability for Alice and Bob to obtain the 
patterns A^ and Hf, while P{A^\Ka = x) and 
P{By\KB = y) are the marginal conditional proba¬ 
bilities for Alice and Bob, respectively. The condi¬ 
tional mutual information H(A : B\Ka = x,Kb = 
y) gives the maximum number of bits per frame that 
can be extracted from (x, y)-frames. 

One average, Alice and Bob can extract 
K(A : B\Ka,Kb) = = x,Kb = 

y)H{A ■- B\Ka = x,Kb = y) bits per frame, 
where P{Ka = x,Kb = y) is the probability 
for Alice and Bob to observe x and y clicks, 
respectively. Notice that H{A : B\Ka, Kb) ^ 
Hframe{A '- B), hence we have lost some infor¬ 
mation. We find that P[frame{A : B) = H{A : 
B\Ka,Kb) + H{Ka,Kb), where H{Ka,Kb) = 
-Y.x,vP(P^a = x,Kb = y) \og2P{KA = x,Kb = y), 
i-e-, it is the entropy in the uncertainty in the num¬ 
ber of clicks per frame. The loss of information thus 
follows simply from the fact that Alice and Bob 
announce the values of Ka and Kb- 
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In a practical application, one may not be able to 
develop effective error correcting codes for all of the 
classes of frame. In this instance, H{A : B\KatKb) 
will over-estimate the extractable information. The 
actual extractable information can be found by av¬ 
eraging H{A : B\Ka = x,Kb = y) over the frames 
for which we do have error correcting codes. For ex¬ 
ample, if we only have codes for (l,l)-frames, then 
the extractable information is P{Ka = 1,Kb = 
l)H{A-.B\KA = hKB = l). 


III. CALCULATING THE CONDITIONAL 
PROBABILITIES AND CONDITIONAL 
MUTUAL INFORMATION 

In this section, we present a general procedure for 
calculating the conditional probabilities and hence 
the conditional mutual information. The approach 
allows us to calculate the probabilities for a gen¬ 
eral source and includes the effects of errors such as 
channel losses and detector imperfections. The first 
step is to work out the detection probabilities for a 
single time-bin. We then use these to construct the 
probabilities to observe specific measurement pat¬ 
terns, from which the conditional probabilities are 
calculated. 

One thing to notice is that the coherence between 
the time-bins does not appear in our calculations. 
The reason for this is that our present results are for 
the case when we measure within the time-of-arrival 
basis. Such measurements cannot detect coherence 
between time-bins. Furthermore, they will actually 
destroy any coherence. The situation we thus con¬ 
sider is one where a measurement has been made 
of the photons time-of-arrival, which inevitably dis¬ 
turbs the temporal coherence. The temporal coher¬ 
ence would, however, be reveled if one measured in 
a basis that was a superposition of time-bins. For 
instance, consider a mode-locked laser generating a 
train of coherence pulses acting as a pump for a non¬ 
linear crystal. The coherence between the time-bins 
of the down-converted photons is not evident if we 
measure the time-of-arrival. However, it has been 
demonstrated experimentally using a Franson inter¬ 
ferometer [35j |. 


A. Single time-bin probabilities 

To calculate the single time-bin detection proba¬ 
bilities, we must first model the source, channel and 
detectors. The approach we use is based on that 
presented in [l^. We thus only give a brief recap 
of the important points. First, we assume that the 
source produces pairs of entangled photons, where 
the probability to produce m pairs within any given 


time-bin is Ps{m). For simplicity, we initially as¬ 
sume that Ps{'m) is the same for each time-bin. Let 
A be the average number of photon pairs produced 
per time-bin, hence A = mPs{m). 

The information is encoded in the temporal lo¬ 
cation of the photon pairs, not their number. For 
this reason, we assume that the detectors do not re¬ 
solve photon number. For ideal detectors, with no 
losses, the probability to observe a click in a time-bin 
is All real detectors, however, suffer 

losses. Let Alice and Bob’s detectors loss be ^.4 and 
^B respectively. The probability to detect a single 
photon that is incident on Alice’s detector, is thus 
^A- In addition to the losses due to inefficiencies in 
the detector, there are also losses from transmission 
of the photons from the source to the detectors. Let 
rja and ijb be the losses in Alice and Bob’s chan¬ 
nels, respectively. Combining the two sources of loss 
into a single total efficiency, Alice’s total efficiency is 
VA = ^AVa, and Bob’s total efficiency is tjb = ^AVb- 

We are now in a position to calculate the proba¬ 
bility for Alice and Bob to observe photons within 
a single time-bin. The key mathematical method is 
to use moment generating function to include the ef¬ 
fects of loss. See [1^ for a full discussion on moment 
generating functions and their properties. 

For a source described by the probability distri¬ 
bution Ps{m) and total losses for Alice and Bob of 
rjA and 773 , respectively, we define the moment gen¬ 
erating function 

00 

^ P,{m){l-iqAv)'^{l-TlBO'^, (3) 

m—0 

where we have neglected dark counts for now. Con¬ 
sider a single time-bin. The probability for Alice and 
Bob to observe a click within a given time-bin is de¬ 
noted by ttA®, where i,j G { 0 , c} and c represents a 
click while 0 signifies no click. It can be shown that 
the probabilities are [l^ 

( 4 ) 

00 

n—1 

The effect of dark counts is taken account of using 
the following procedure. Let Pij represent Alice and 
Bob’s probability to detect photons within a single 
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time-bin when dark counts are present. We find that 

Poo = (1 - <?)^7r^o® (5) 

Poc = (1 - q)T^of + (1 - q)q^oo^ 

Pco = (1 - q)'^c^ + (1 - q)qT^oo 
Pec = + q4e^ + + q^^o"", 

where q is the probability to observe a dark count 
in a single time-bin. We see that the above proba¬ 
bilities sum to one. The marginal probabilities 
and are found from the joint probability Pij. A 
key feature of these general expression for Pij is that 
they are valid for any choice for the source probabil¬ 
ity Ps{m). Thus, our results are not be limited to 
any particular physical implementation. 


B. Probabilities for each frame 


The probability for Alice (or Bob) to observe a 
particular measurement pattern is calculated using 
the relevant single time-bin probabilities P^ or P^. 
The probability for Alice to see a pattern is 
P{A^^) = ■ The total probabil¬ 

ity for Alice to observe a measurement pattern with 
photons detected in x time-bins is 

P{KA = x)=(^iyP^Y[P^f-\ ( 6 ) 


The probabilities for Bob have the same form, but 
instead use the probabilities P^. 

The joint detection probabilities P{A^^, is 

calculated using the single time-bin joint detection 
probabilities. We find that 




(7) 


The total probability for Alice and Bob to detect 
photons within x and y time-bins, respectively, is 
thus 


P{Ka =x,KB=y) = Y, ^l{Ka, Kb) 

L 

X [Pec]"' [Pco]"-^ [Poc]"-"' [Poo]'^-"-"+"', (8) 

where ftL{KA, Kb) denotes the multinomial coeffi¬ 
cient A^!/[L! (AT^ - P)! (ATb - P)! (A^ - A :.4 - -b P)!] ■ 
Equation ([5|) leads to the same marginal probabili¬ 
ties as in Eq. ®. 

The conditional probabilities are calculated from 
Eqs. ®, d?]) and ®, by recalling the definition of a 
conditional probability: P(X|y) = P{X,Y)/P{Y). 


We find that 


P{A^\Ka = x) 


P(Pf j/Ps = y) 

P{A-,By\KA = x,KB=y) 



pL px — L py — L pN — x — y-\-L 
^cc^cO ^Oc ^00 _ 

P{Ka=x,Kb=v) 


where P{Ka = x,Kb = y) is given in Eq. ([ 8 |). The 
expressions for P(A“|/P^ = x) and P{By\KB = y) 
are not true when tja = Vb = 1- The reason for this 
is that, in this limit, Alice and Bob must observe 
the same patterns and Ka = Kb ■ This implies that 
P{Ka = x) and P{Kb = y) will be zero ii x y. In 
the case of rjA = rjB = 1, then Ka = Kb = K and 
the conditional mutual information has the simple 
form 


H[A : PjiP = x) = log 2 



( 10 ) 


In the remainder of this section, we focus on the case 
where both t/a and rjB 


C. Information per photon pair 

We calculate the various entropic quantities, by 
using Eqs. m through to ®. Eor instance, we 
hnd that Hframei^ • P) = NH{A : B), where 
H(A : B) is the mutual information per time-bin. 
Suppose we only have error correcting codes for 
(l,l)-frames, i.e., frames where Alice and Bob both 
announce that they each observe a single click. The 
maximum extractable information is Hi i{A : B) = 
P{Ka, Kb)H{A : B\Ka = 1,Kb = 1). 

A typical application of this theory is in high¬ 
dimensional QKD, which aims to encode multiple 
bits on each photon pair. It is thus worth consid¬ 
ering the bits per photon pair. The average num¬ 
ber of photon pairs generated within each frame is 
NX. Due to losses, the average number of photon 
pairs that one detects per frame is N(riAVBX q^). 
The average number of bits per generated photon 
pair is iJiq(A : B)/{NX), while the average num¬ 
ber of bits per detected photon pair is i/ip(A : 
B)/{N[riA'r]BX -b g^]). These quantities can be cal¬ 
culated using the conditional probabilities given in 
Eq dOj), within Eq dlj). We find that, by using only 
(1, l)-frames, the average number of shared bits per 
detected photon is 

Ha{A : B\Ka = I, Kb = 1) =-W 

VaVbX + q^\ T 

pPeePoO log(Pcc7oo) -b (A^ - l)PcoPoc \og{PcoPoc)'j , 
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where T = {N — l)PcoPoc + PccPoo and Pij are given 
in Eq. ©■ The expression for the average number 
of bits per generated photon has the same form as 
CD, but with rjArjB^ + q^ replaced in the denomina¬ 
tor by A. We stress that Eq. CD includes the effects 
of losses, dark counts and a general source. Further¬ 
more, it can also be applied to situations where the 
dark count rates are different on each side. This 
is accomplished by modifying only the probabilities 
Pij , not the form of Eq. CD- 

Suppose we have error-correcting codes that work 
for (2,2)-frames. The extractable information when 
using only these frames is found using Eqs. to¬ 
gether with (I2|) and ([5]), giving the maximum num¬ 
ber of bits per detected photon pair as 


Hd{A : B\Ka = 2,Kb= 2 ) 


’nA'TiB^ + q^ ^ 


X log 


N{N -l) 


+ {PccPoo)'^ logiPcc-Poo] 


+ 2(^130^00)^ log[f’c 0 ^bc] 

-|-(A^ — 2)PccPcoPocPoO log[T*cc^cO-Poc^bo] I, 

( 12 ) 


where — ^(fbo^cc)^ + ~ 2)PccPcoPocPoo + 

j{N - 2){N - 3){PcoPoc)^- As with Eq. ([TT]), we 
obtain the information per generated photon pair 
by replacing riAriB^ + in the denominator with A. 

If we have error correcting codes for both (1,1) and 
(2,2)-frames, then the total amount of extractable 
information is the sum of the information for the 
two cases, i.e., Hd{A : B\Ka = 1, Kb = 1) + Hii{A : 
B\Ka = 2, Kb = 2). In general, the information 
we can extract using only (x, 2 /)-frames is calcu¬ 
lated using the conditional probabilities ([5]) and ([H]). 
As X and y become large, the resulting expressions 
become more complex. Nevertheless, we can still 
obtain analytic expressions by following the same 
straightforward procedure. 

As Alice and Bob publicly announce Ka and Kb , 
they are losing all the information contained within 
the correlation of these quantities. It is possible to 
develop approaches that retain some of this informa¬ 
tion. As showed in Sec. m the correlations in Ka 
and Kb give a total contribution of H{Ka, Kb) bits 
per frame to the total shared information per frame. 
In terms of bits per detected photon pair, we can 
gain an additional iJ(Aryi, Kb)/{ yAyB^Pq^) bits per 
photon. In practice, protocols generally will access 
a certain fraction / of these bits, where 0 < / < 1. 


IV. ADDITIONAL ERRORS: 
AFTER-PULSING AND DETECTOR 
DEAD-TIME 


Losses and dark counts are not the only errors 
that affect the extractable information. There are 
also effects such as detector jitter, after-pulsing and 
detector dead-times. The discussion of jitter is post¬ 
poned until the next section. In this section, we 
explain how the formalism is modified to describe 
after-pulsing and dead-time. 

After-pulsing occurs when the detection of a pho¬ 
ton sets up a feedback process that can lead to the 
detector registering a click at a later time [13 ■ After¬ 
pulsing will thus temporarily increases our chance to 
see a dark count after we register a click. One ap¬ 
proximate model of after-pulsing is to increase the 
dark count probability q for some fixed number of 
time-bins /3 after a detection. One important fea¬ 
ture of after-pulsing is that it occurs regardless of 
what triggered the detector. This means that after¬ 
pulsing occurs also for dark counts. The single time- 
bin detection probabilities, Eqs. include con¬ 

tributions from dark counts. This means that our 
approach will take account of after-pulsing that is 
generated both by photons and from dark counts. 

The value for /3 can be large [13 ■ This means 
that a click near the beginning of a frame can result 
from after-pulsing from the previous frame. Sim¬ 
ilarly, the average position of detected photons is 
random, which means the location of the (3 time- 
bins will also be random. Recall, however, that we 
are calculating the shared information for an aver¬ 
age frame. To take account of these difficulties, the 
fairest approach is to modify q for all time-slots. In 
this case, the information per photon will retain the 
form given in in Eqs. CD and CD. but where the 
value of q has been suitably increased. 

After a photon is detected, it is common for a 
detector to loose sensitivity to subsequent photons 
for a period of time. This interval of time is know 
as the detector’s dead-time [13- If the duration of 
the dead-time is equal to the width of Md time-bins, 
then we will not observe photons for at least the 
next Md time-slots after a detection. Dead-time is 
not a serious problem for (l,l)-frames, provided that 
the frame is longer than the period of dead-time. In 
this limit, Eq. CD is still valid. However, the effects 
of detector dead-time will be important for classes 
of frames such as (2,2)-frames. For these cases, we 
must adopt the following modified procedure. 

First, we calculate the moment generating func¬ 
tion and the single time-bin probabilities. We then 
calculate the probabilities to observe each pattern, 
however, now we must set P{A^^), P{Bf’^) and 
P{A^^ equal to zero if r or s contain I’s in 

time-slots that are closer together than Md- We then 
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calculate the new probabilities P{Ka, Kb), P{Ka) 
and P{Kb), together with the new conditional prob¬ 
abilities. Finally, the conditional probabilities are 
used in Eq. m to calculate the conditional mutual 
information. The approach is best illustrated by an 
example. 

Suppose we have a detector with dead-time of the 
order of one time-bin width, i.e., = 1. This 

means that it is impossible for Alice to observe two 
photon measurement patterns such as 1100 or 0110. 
The probability to observe such patterns must be set 
to zero, hence P{^iqq) = 0. This reduces the num¬ 
ber of two photon patterns that Alice can observe 
from N{N — l)/2 to {N — 1){N — 2)/2. In general, 
dead-time reduces the total number of allowed two- 
photon patterns to {N — Md)[N — Md — l)/2. 

It is convenient to introduce a function 
that is 0 if the pattern contains I’s that are 
closer together than Md- Otherwise, the function 
returns the value of 1. For example Ai(lOlO) = 1, 
while Ai(llOO) = 0. The new probabilities to ob¬ 
serve measurement patterns can be expressed in 
terms of P{A^'^, B^^), P{A^^) and P{B^^), the 
probabilities for the case when there is no dead-time 
effect. As an example, consider the new probabili¬ 
ties, denoted by a tilde, for the case when Md = 1. 
We find that 


P{A^\Bf-) = 
P{A^^ 
P{Bf-) = 


p2 5 


P(AKa \ - P(A^^)I\^(A^^) 
\-^r ) {PqY ’ 


(13) 


The reason for dividing by either (Fbo)^ or 
is to take into account the fact that, after our two 
clicks, we cannot detect anything. This is distinct 
from not observing a click, which happens with joint 
probability Pqo and marginal probabilities P^^ and 
P(f. We note that Eq. (IT^ neglects the effect of 
obtaining a click within the last time-bin, e.g., ob¬ 
serving a pattern such as 01001. In cases like this, 
we should only divide P{A^^) by Pq^^\ This is 
because we don’t see the dead-time for the last de¬ 
tection. When the frame size is large, the relative 
probability to observe a click within the last time-bin 
becomes small. In this regime, our approximation is 
very good. 

The modification of the above results to the case 
where Md > 1 is straightforward. If we neglect the 
effects of the frame edge, then P{A^^,B^^) = 
P(Af-, Bf-) Am, (Af-) Am, {Bf- )/[Poo?^‘^ ■ 

The marginal probabilities are P{X^) = 

P{X)lf)X{X)^)/[P^]'^’^'^, where X is either A 
or B. When Md becomes larger, relative to the 
frame size, the approximation may seem dubious. 
One can explicitly take account of the edges by 


changing the probabilities such as P(AfOOl). How¬ 
ever, as Md becomes large, this also increases the 
probability that a detector cannot register photons 
in the beginning of a frame due to dead-time from 
a click in the previous frame. The effect of these 
two edge effects is to act in opposite ways. One 
increases the pattern probabilities, while the other 
acts to decrease them. The net effect is that, to 
some extent, both effects compensate for each other. 
It is thus still a good approximation to neglect both 
edges. 


V. DETECTOR JITTER 

One thing we have omitted so far is the temporal 
response of the detectors. In any real detector, there 
can be a randomly fluctuating delay between a pho¬ 
ton being incident on the detector and it firing. This 
is very important in time-binned experiments, as it 
can cause a photon to be registered in the wrong 
time-bin. This effect is known as detector jitter [s^. 
In this section we show how jitter can be included 
within our model. For the sake of clarity, we illus¬ 
trate the approach only for (1, l)-frames. The gen¬ 
eral method, however, can also be applied to other 
frame classes. 

A simple way of modeling jitter is to calculate a 
discrete set of ‘jump’ probabilities from the tempo¬ 
ral response. Mathematically, the temporal response 
is the probability distribution to register a photon 
at a time t after it was incident on the detector.^ 
By integrating over the width of each time-bin, we 
convert the continuous probability distribution into 
a discrete set of detection probabilities. Suppose a 
photon is generated within the r-th time-bin. Let 
Jn be the probability that we observe a click within 
the {r + n)-th time slot. The probability to observe 
the photon within the correct time-bin is thus Jq. 
Clearly, = 1. Often, J„ is non-zero only for 

n = 1 or 2. 

One difficulty in modeling jitter is the presence of 
dark counts. The single time-bin detection proba¬ 
bilities include dark counts, which are not subject 
to jitter. However, the probability q is the same 
for each time-bin.^ The dark count probability is 
thus invariant to shifts in time. This suggests that 


^ Often the detector’s response is well described by a Gaus¬ 
sian. The approach we will outline, however, makes no 
assumption about the form of the continuous probability 
distribution. 

® This will not be true in the presence of after-pulsing. Nev¬ 
ertheless, this does not affect our results as we adopted an 
overly cautions approach where after-pulsing is modeled by 
increasing q for every time-bin. This means that q is the 
same for every time-bin in our mathematical formalism. 






contribution from dark counts within Pij should 
also be approximately invariant to temporal shifts. 
Hence, to an excellent approximation, it is not ef¬ 
fected by jitter.'^ This observation means we can use 
the jump probabilities directly with the probabilities 
Pij , without having to separate out the contribution 
from dark counts. 

We begin by looking at the marginal probability 
for Alice to observe a particular measurement pat¬ 
tern. It is convenient to modify our notation. As we 
are interested in the case where Ka = ATb = 1 , we 
represent Alice’s pattern as Ai, where i is the loca¬ 
tion of the time-bin where the photons are detected. 
For example, Ai represents the pattern where Alice 
observes a click within the first time-bin. 

As a further simplification, we limit our analy¬ 
sis to the case when the detector’s response is short 
enough so that only Jq and Ji are greater than zero 
and Jq + Ji = 1 while J 2 = 0. Appendix A ex¬ 
plains how to generalize the results to the case when 
J 2 7 ^ 0. Consider two time bins that are away from 
the edges of the frame. In these time-bins we observe 
a single click in the second of them, i.e., our mea¬ 
surement pattern is 01. If there were no jitter, then 
this pattern occurs with probability PqP^. When 
the detector jitter is not negligible, then the pattern 
could have arisen from two possible situations. First, 
there was no delay and we observe the photons in the 
correct time-slot, which occurs with probability Jq. 
Alternatively, jitter could have caused a photon that 
was in the first time-bin to be registered within the 
second. We find that the total probability to observe 
the pattern 01 is 


Vi = JoP^P^ + JiP/. (14) 

If a photon is incident on the last time-bin within 
a frame, then jitter can cause it to be lost to the 
frame. Similarly, a click within the first time-slot 
of each frame could have come from the previous 
frame. Let Ve be the probability to not see a click 
within the last time-bin of a frame. We find that 

Pe = + JlPt (15) 

We thus see that the probabilities for Alice to ob¬ 
serve a given pattern is 

P(Ai) = VlVe{P^)^-\ 

P{A,) = VlVe{P^)^-\ 

P{An) = Vi{P^)^-\ (16) 


^ This approximation is very good for q small. To be more 
quantitative, if the temporal width of each time-bin is Ins, 
then the approximation is excellent provided the dark count 
rate is less than about 10® counts per second. 


where 1 < i < N. The results for Bob will have the 
same form, but with each A changed to a B. 

The joint probabilities, P{Ai, Bj), are more in¬ 
volved. To simplify our exposition, we assume sym¬ 
metric channel losses, i.e., t]a = rjB- The case where 
Va 7 ^ Vb is described in Appendix B. Each pat¬ 
tern can be broken up into a small set of events, 
from which each pattern can be constructed. For 
example, one event is described by the probability 
for both Alice and Bob to observe clicks within the 
same time-bin. The probabilities of these events can 
be expressed in terms of Jq, Ji and the single time- 
bin probabilities Pij. The probabilities P{Ai,Bj) 
are expressed in terms of the event probabilities. 

Let 7^11 be the probability for Alice and Bob to 
both observe clicks within the same time-bin. The 
fact that the detectors suffer from jitter means that 
we must consider two time-bins to calculate Pu. It 
is found that 


7^11 — JqPoqPcc 

+2JoJi(Poc)" +JiBcc, (17) 

where we use the fact that P^^ = P^^^ when rjA = 
rjB. The probability that both Alice and Bob do not 
see a click in the last time-bin of their frame is 

T’oo = Too + JiPcc + 2JiPoc, (18) 

where we have again use the fact that P^^^ = P^^. 

It is possible that Alice and Bob can see clicks 
in adjacent time-bins. For example, Alice’s detector 
could fire within the n-th time-bin, while Bob’s fires 
within the {n + l)-th. Let T^i* be the probability for 
Alice’s detector to fire in a time-bin directly before 
Bob’s. Similarly, let 7^*i be the probability for Bob 
to observe a detection in the n-th time-bin while Al¬ 
ice sees one in the (n -|- l)-th time-slot. To calculate 
7^*1 and Pi„, we need to consider three time-bins. 
We find that 


7^*1 — 7^1* — JoPooiPco)^ 


JJiPcPoPco + JqJiPq 


PoqPcc + (Poc)^ , (19) 


where Pq = P^^ = P^ and Pc = P^ = P^ ■ The 
final situation that we consider is when Alice and 
Bob obtain clicks in different time-bins, which are 
not adjacent. The fact that J 2 = 0 implies that we 
can be certain that any photons detected by Alice 
and Bob were not from the same photon pair. Let 
PiQ be the probability for Alice to observe a click 
in a time bin when Bob does not see a click in the 
same or adjacent time bins. Similarly, T^qi is the 
probability for Bob to observe a click while Alice 
does not see one in nearby time slots. We find that 


7^10 — 7^01 — JoPoqPco + JiPcqPq 
+ JqJiPqqPcc + JiPcqPc- 


( 20 ) 
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These event probabilities can be used to construct 

One complication with calculating P{Ai,Bj) is 
that we be must careful of detection events near the 
edges of the frame. For instance, the probability 
P{Ai,Bi) is different from P{A'i, B^). This differ¬ 
ence is due to the fact that a detection in the first 
time-bin could have come from the previous frame. 
When the size of the frame becomes large, the rel¬ 
ative effects of the edges becomes small. One could 
thus neglect the effects of the edges®. In this case, 
the probability for Alice and Bob to observe partic¬ 
ular measurement patterns is given by 

P{A,,B,) = \^-J\>P 

( 21 ) 

In Appendix C, we give the full form of P{Ai,Bj), 
where edge effects are not neglected. Using the prob¬ 
abilities (ED), we find that 


PiKA = 1 ,Kb = 1) = noiPoi^'f-" NPniPoT) 


jAB\2 


+2{N - l)PioPoi^ + {N-1){N- 2)Vl 


( 22 ) 


The post-selected information per detected photon 
pair is thus 


Ha{A : B\Ka = 1,Kb = 1)= 

X P{Ka = 1,Kb = 1) 

X{2 log2 N - log2 P{Ka = 1, Kb = 1)} 
+NP{Ai,B,)log^PiA,B,) 

+2{N - 1)P(A„ 5,+i) log2 P{A, B,+i) 

+ (iV _ i)(iv - 2)P(A„ B,) log2 P(A„ B,)] .(23) 


The assumption that we neglect edge effects means 
that Eqs. ED, ED and (1231) are all valid only for 
N > 6. In Appendix C, we compare the approxi¬ 
mate results given above with the more complicated 
exact results. It is shown that even for A^ = 8, the 
difference between the exact and approximate ex¬ 
pressions can be very small (less than 0.1%). Thus 
we can safely use the approximate expression given 
in Eq. dMl). 


® We are not neglecting all of the effects of the edges. We 
still including the probability that one could loss a photon 
from the edges of a frame. 


VI. RESULTS FOR A MODE-LOCKED 
LASER PUMPING A SPDC SOURCE 

The previous results will now be illustrated by 
looking at a specific experimental setup. The sit¬ 
uation we consider is a mode-locked laser that pro¬ 
duces a train of coherent pulses that pump a nonlin¬ 
ear crystal. The pulses are generated such that each 
pulse is coherent to one another [2ll - [^ . We fix the 
parameters of the crystal and laser such that we ob¬ 
serve SPDC that produces a pair of photons that are 
correlated in time. The down-converted photon pair 
is split with one half kept by Alice, while the other 
is sent to Bob. The experimental configuration is 
shown in Fig. [TJ 

The spacing of the pulses define natural time-bins 
for Alice and Bob. Alice and Bob thus choose the 
widths of their time-bins so that they contain a sin¬ 
gle pulse. To a good approximation, the probability 
that Alice and Bob observe m photon pairs in each 
time-bin is given by a Poissonian distribution 

\m 

Ps{m) = e~^ — , (24) 

ml 

where A is the average number of photon pairs gen¬ 
erated in each time-bin. For this source, the mo¬ 
ment generating function, defined in Eq. ED, is just 
= e-xp{\[-r]AV - rjB^ + tjaVeI'^])- Using 
this within Eqs. (21) and ([5]) yields joint detection 
probabilities for each time-bin 

Poo = (1 (25) 

Po^ = (I - q)e-^^^ - (1 - g)2e-^[’7A-fr,B-,?A-7B]^ 

P^o = (1 - q)e-^^^ - (1 - q)2e-^hA-f^B-^A^B]^ 

P,c = l-(l-g) 

_(_ (1 _ q'^'^gi-MVA+VB-riAVs] ^ 

The marginal probabilities for Alice (Bob) are 
Pg^(^) = (I - q)e-^^MB) and P^^^^ = 1 - Pq^^\ 

Suppose that the main sources of errors are losses 
and dark counts. One can use Eq. (1^ directly 
within ED to determine how many shared bits per 
detected photon we can extract using only ( 1 , 1 )- 
frames. The extra information contained in (2, 2)- 
frames can be calculated using Eq (1121) . These re¬ 
sults can be used to optimize the frame size N. Fur¬ 
thermore, one can also investigate how the exper¬ 
imental parameters affect the number of bits per 
photon. This could be important, for instance, in 
evaluating the advantages of improving the detec¬ 
tor’s efficiency. 

To illustrate our results we look at typical parame¬ 
ters for two detectors: a single-photon avalanche de¬ 
tector (SPAD) and a superconducting nanonwire de¬ 
tector. We assume that we have time-bins of width 
130 ps. The SPAD has efficiency r] = 0.7, dark count 
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g (a) SPAD 



FIG. 2: Shared bits per detected photon pair as a func¬ 
tion of the frame size N. All plots are for A = 5.33 x 10“®. 
In both (a) and (b), the dashed red line is for (1,1)- 
frames, the dotted blue line is for (2, 2)-frames and the 
solid black lines if for both the (1,1) and (2, 2)-frames. 
Fig. (a) is for a single photon avalanche detector with 
T] — 0.7 and q — 6.53 x 10“®. Fig. (b) is for a nanowire 
detector with rj = 0.9 and q = 1.3 x 10~^° (color online). 


rate of 500/s and an after-pulsing rate of 0.5%. The 
effective dark count probability, which includes the 
effects of after-pulsing, isq = 6.53 x 10“®. For the su¬ 
perconducting nanowire detectors ij = 0.9, the dark 
count rate is 1 /s and the after-pulsing rate effec¬ 
tively zero. We calculate the dark count probability 
as g = 1.3 x 10~^°. Figures 2 and 3 shows the infor¬ 
mation within ( 1 , 1 ) and ( 2 , 2 )-frames as a function 
of the frame size N, for a SPAD and superconduct¬ 
ing nanowire detector, for two different values of A. 

We see in Fig. 2 that, for N = 1000, (2, 2)-frames 
can contain a significant fraction of the total shared 
bits. This is not always true, however, as shown in 
Fig. 3. Another important point to note from Fig. 
3 is that, for N = 3000, we can extract over 11 bits 
per photon pair using either of the two detectors. 
For both the detectors we considered, the efficiency 
is high. If the detectors have low efficiencies, then we 
would obtain less information; it would thus become 
crucial to optimize the frame size. For example, con¬ 
sider a detector with g = 6.53 x 10“® and 77 = 0.3. 
We find that, for a source with A = 5.33xl0“®, we 
obtain 10.3 bits from the (l,l)-frames by choosing 
N = 3579. 

The design of error correcting codes for (2,2)- 


(a) SPAD 



FIG. 3: Shared bits per detected photon pair as a func¬ 
tion of the frame size N. All plots are for A = 1.0 x 10~®. 
In both (a) and (b) the dashed red line is for (1,1)- 
frames, the dotted blue line is for (2, 2)-frames and the 
solid black lines if for both the (1,1) and (2, 2)-frames. 
Fig. (a) is for a single photon avalanche detector with 
Tj — 0.7 and q — 6.53 x 10“®. Fig. (b) is for a nanowire 
detector with rj = 0.9 and q = 1.3 x 10“^° (color online). 


frames can be difficult. If we find that, for given val¬ 
ues of loss and the dark count rate, Hd{A : B\Ka = 
2 , Kb = 2 ) is negligible, then we know that it would 
not be worth using these frames.® This result also 
provides a good guide to determine the regime that 
one must work in so that ( 2 , 2 )-frames contribute 
signihcantly. Similarly, one can use the results of 
Sec. Ill to calculate the information within (2,1)- 
frames. One could thus investigate the gains from 
developing error correcting codes for these and other 
situations. 

The previous results did not include the effects of 
detector dead-times. However, if we are to fully eval¬ 
uate the information contained within ( 2 , 2 )-frames, 
then we must take this effect into account. Results 
for this can be found using the approach detailed 
in Sec. IIVI For the SPAD, the dead-time is 30 ns, 


This conclusion still holds if jitter is significant. This is 
because additional errors cannot increase the shared infor¬ 
mation. 
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FIG. 4: Shared bits per detected photon pair as a func¬ 
tion of the frame size N. Both plots are for (2,2)- 
frames with A = 5.33 x 10“®. The black curve is for 
a superconducting nanowire detector with rj = 0.9 and 
q — 1.3 X while the dashed blue curve is for a 

SPAD with rj = 0.7 and q = 6.53 x 10“®. The dead¬ 
time for the superconducting nanowire corresponded to 
154 time-bins, while the dead-time for the SPAD corre¬ 
sponded to 230 time-bins (color online). 


which corresponds to approximately 230 time-bins, 
while for the superconducting nanowire, the dead¬ 
time is 20 ns, which corresponds to 154 time-bins. 
Figure |4] compares the the shared information in 
(2,2)-frames for the case of dead-time and no dead¬ 
time, where part (a) is for the SPAD and (b) is for 
the superconducting nanowire. Both curves are for 
A = 5.33 X 10“®. While dead-time can reduce the 
information, we still see that useful information can 
still be extracted from (2,2)-frames. 

In many realistic situations, detector jitter is non- 
negligible. We can include the effects of jitter by 
using the formalism described in Sec. 5. There is, 
however, a subtlety when one applies the theoretical 
results to an experiment. It is common to calculate 
the heralded efficiency directly from experimental 
data. Detector jitter decreases the probability to ob¬ 
serve photons within a particular period of time. If 
one is not careful, then we could over estimate losses 
and hence under estimate rj. To illustrate this, con¬ 
sider the example where we have a source that can 
produce a single photon within a specific time-bin. 
One can use this source to estimate rj by looking at 
the probability w to detect the photon. If our detec¬ 
tors suffer from jitter, then w ^ rj. Instead, we have 
w = rjjq when we neglect dark counts. 

In some situations, under estimating the efficiency 
can be a good thing. For example, a reduction in r] 
will decrease our estimate of the number of bits we 
can extract. We could use this as a crude way of tak¬ 
ing account of jitter. Such an approach would, how¬ 
ever, be too pessimistic if we have already included 
jitter explicitly within our model. In the rest of this 


c 

o 



FIG. 5: Plot of shared bits per detected photon pair as 
a function of the frame size N. All plots are for (1,1)- 
frames with A = 2.0 x 10“®. The solid black line corre¬ 
sponds to the SPAD with rj = 0.7, q = 6.53 x 10“® and 
Jo = 0.9. The dashed blue line is for the superconducting 
nanowire with rj = 0.9, q = 1.3 x 10~^° and Jo = 0.97. 
Finally, the dotted and dashed red line corresponds to 
the threshold for 10 bits per photon pair (color online). 

section, we will assume that the total efficiency rj has 
been estimated such that it is completely associated 
with losses. 

Jitter causes a decrease in the correlation within 
Alice and Bob’s timing information. This inevitably 
leads to a decrease in the number of shared bits. To 
evaluate the effects we calculate Hd{A : B\Ka = 
1,A'b = 1) for the SPAD and superconducting 
nanowire detector, with a A = 2.0 x 10“®. We 
take Jo = 0.9 for the SPAD and Jq = 0.97 for 
the nanowire detector. Figure [5] shows Hd{A : 
B\Ka = IjATb = 1) plotted as a function of N. 
The solid black curve corresponds to the SPAD, 
while the dashed blue line is for the superconduct¬ 
ing nanowire. We see that for an appropriate choice 
of N, we can still obtain greater than 10 bits per 
photon using either detector. One can get a bet¬ 
ter feel for how jitter affects use by looking at how 
H[A : B\Ka = IjATb = 1) changes with Jq. Con¬ 
sider a setup with A = 2.0 x 10“®, rj = 0.7 and 
q = 6.53 X 10“®, i.e. the parameters for the SPAD. 
We could extract 11.1 bits per photon for N = 4000, 
if we had no jitter (Jq = 1). If instead, Jq = 0.9, 
then we could extract 10.2 bits per photon for frames 
of size N = 4000. 


VII. CONCLUSIONS 

The time-of-arrival degree of freedom provides an 
experimentally viable means of implementing high¬ 
dimensional quantum information protocols, and is 
particular well suited for quantum communication. 
One important example of this is high-dimensional 
QKD. Such schemes can, however, be hampered by 
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the difficult in performing error correction. A per¬ 
tinent example of this is in time-bin based QKD, 
where unlike in polarization based QKD, one can¬ 
not use each photon’s arrival time to help correct 
errors. Instead, it is common to split the arrival 
time into discrete time-bins, which are grouped to¬ 
gether to form frames. A key question is how this 
affects the amount of shared information that Alice 
and Bob can extract. We answer this question and 
obtain general results for the maximum number of 
extractable shared bits for photons entangled within 
their time-of-arrival, when using frame encoding. 

Our results go beyond existing work in a number 
of areas. First, we present results for frames that 
contain photon pairs within multiple time-bins. We 
can thus investigate how many bits are lost by ne¬ 
glecting such events and when such events should 
be kept. The results can thus be used to improve 
the efficiency of extraction of shared bits from noisy 
experimental setups. 

Another way in which the current work improves 
on existing works is in the range of errors consid¬ 
ered. We study systems that suffer from asymmetric 
losses, dark counts, after-pulsing, dead time and jit¬ 
ter. The formalism also works for a general choice of 
source. The results for the case of jitter are of par¬ 
ticular interest. We have found analytic expressions 
for the extractable information when we have jitter 
in addition to losses and dark counts. This could 
be important for optimization of high-dimensional 
QKD. For example, in some experiments, we have 
freedom in the choice of the time bin width. Often, 
one chooses the width such that it minimizes jitter. 
By using our results, we can more efficiently choose 
the time-bin width so as to optimize the shared in¬ 
formation. 

The results are illustrated by considering at entan¬ 
gled photons generated by a nonlinear crystal that 
is pumped by a mode-locked laser. This source pro¬ 
duces a train of pulses that are temporally coherent 
to one another. Two different types of detector were 
considered, a SPAD and a superconducting nanowire 
detector. The results show that under appropriate 
conditions, we can chose a frame size so as to extract 
over 10 shared bits per photon pair. 

One issue we have not considered is how one 
might actually extract the shared information, j.e., 
reconciliation. There has been some work in this 
area [H, [H, [2^. Some of the present authors 
have developed a reconciliation protocol that is tai¬ 
lored to the case of a mode-locked laser pumping 
a down-conversion source [40| . This protocol can 
treat multi-photon events and can recover some of 
the frame-to-frame information contained within the 
photon number uncertainty. 
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Appendix A: Modeling jitter for a long-tailed 
detector response 

The calculations in Sec. 5 assumed that J„ = 0 
for n > 2. This is consistent with the detector’s 
temporal response (z.e., its probability distribution) 
being effectively zero over more than two time-bins. 
While this assumption is often true, there are detec¬ 
tors for which it would not hold. In this Appendix, 
we briefly outline how to generalize the previous re¬ 
sults. The aim is not to present extensive results, 
but instead to show how to adapt the previous re¬ 
sults. The general approach is illustrated by inves¬ 
tigating at the case when J 2 ^ 0 , but J 3 , J 4 ,... = 0 , 
hence Jq + Ji + J 2 = 1 . 

Recall, Jo is the conditional probability to register 
a photon in the correct time-bin, i.e., the time-bin in 
which the photon actually was incident on the detec¬ 
tor. The conditional probability to register a click 
n time-bins after it was incident on the detector, is 
given by J„. We first consider Alice’s (or equiva¬ 
lently Bob’s) marginal probabilities. In the absence 
of jitter, the probability for Alice to see a click in a 
given time bin is P^. When we do have jitter, then 
a detected photon could have originated in previous 
time slots. The probability to observe a click thus 
changes. To calculate the new probability Vi, we 
consider three time-bins, as the term J 2 can cause 
a photon to jump over two time-bins. We find that 
the probability for Alice to see a click in a given time 
bin is 

[J 2 + JiP^ + MP^f ]. (Ai) 
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where = 1 — P^. Equation (EB) is composed 
of three separate terms. The first term in Eq. (EB 
corresponds to a photon that has ‘jumped’ two time- 
bins due to jitter. The second term is for a photon 
that is detected within a time-bin directly after the 
correct one. Finally, the third terms corresponds to 
the detector firing within the time slot in which it 
was incident on the detector. 

To calculate the probability T^e, that we don’t see 
a click at the last time-bin of a frame we must con¬ 
sider two time-bins. We find that the probability 
is 

V, = J 2 [P^] + Ji [P^P^] + [Po^] ^ (A2) 

Again, we have three terms corresponding to three 
possible ways in which the event could be realized. 
The probability to observe a given measurement pat¬ 
tern is again constructed from Pi, Pg and P^. One 
thing we must take care of are photons detected in 
time slots near the edge of each frame. It is possi¬ 
ble that these correspond to photons from previous 
frames, which are registered in a later frame due to 
jitter. These edge effects mean that P{A{) or P{A 2 ) 
will not equal P{Ai), where z is a time bin in the mid¬ 
dle of the frame. Similarly, P{Aiy) ^ P{Ai), where 
again i corresponds to a time bin near the middle of 
the frame. We find that 


response is longer and thus jitter can cause a photon 
to be register two time slots after it was incident on 
the detector. 


Appendix B: Jitter with asymmetric losses 

The results for jitter given in Sec. |V] assumed 
that r]A = riB, to simplify the expressions. In this 
Appendix, we briefly show how the results are modi¬ 
fied for asymmetric loss. The marginal probabilities 
for Alice and Bob contain terms that depend only 
on TjA or on rjB- Thus, there is no need to modify 
these results. The joint probabilities P{Ai, Bj) will, 
however, need to be modified. 

The first step is to calculate the probabilities for 
the individual events, e.g. Pn, Pqq, etc. The key 
issue is that now Pco 7 ^ Pqc, which was implicitly 
assumed within the derivations. As a first step, con¬ 
sider the probability that Alice and Bob both see a 
click within the same time-bin. We find that 

Pll = JqPooPcc + 2Jo Jl(Pocp3o) + JfPcc- (Bl) 

The new probability that both Alice and Bob don’t 
detect photons in the last time-bin of their frame is 


P(Ai) = PlPe(Po^)'^-^ 
P{A2) = VlVe{P^)^-\ 
P{A,) = VlVe{P^)^-\ 
P(Aw-l) = VlVe{Po)^-\ 
PiA^) = Pl(Po^)'^-^ 


Poo ~ Poo + JiPcc + Jl\Poc + Pco\- (B 2 ) 

The probability for Alice and Bob to obtain clicks 
in adjacent time-bins is given by Pi* and P*i. Pre¬ 
viously, we found that Pi* = P*i, which is not true 
in general. We find that 


where i < 2 < A — I. The probability for Alice 
to post-select on a Ka = 1 frame is P{Ka = I) = 
P{Aj). The results for Bob will have the same 
form. 

The joint probabilities P{Am, Bn) can be calcu¬ 
lated in the same fashion by first recalculating Pn, 
PEo, Pio and Pi*. However, now we require an extra 
term Pi**, which corresponds to Alice seeing a pho¬ 
ton two time-bins before Bob does. Each of these 
probabilities will again be calculated by looking at 
several time-bins. For instance, to calculate Pn, we 
must consider three time-bins for Alice and Bob. As 
an example, the new form for Pn is 

Pll = JqPooPcc + JiPooPcc + J^Pcc 
+ ‘2'JoJlPooPcoPcc + 2J0J2PC0P0P1 
+ 2 J 1 J 2 PC 0 P 1 , (A4) 


Pi* 

= 

JoPooPcoPoc + J^P^P^PcO 


-b 

Jo Jl 

Po PqoPcc + JcP JbcPcO 

P*1 

= 

JoPooPcoPoc + JlP^P^Poc 


-b 

Jo Jl 

JcP PooPcc + P^ PqcPcO 


The final event probability is for the case when Alice 
and Bob obtain clicks in different and non-adjacent 
time-bins. Again we will find that Pio 7 ^ Poii given 
explicitly by 

Pio = JqPooPcO + JiPcoPq 

+ JoJiPooPcc P JiPcoPc ■ 

Poi = JoPooPoc + JlPocPf^ 

+ JoJlPooPcc + JiPocP^- (B4) 


where we have used the fact that Pco = Poc when 
VA = Vb- Notice that (IA4I) contains more terms than 
Eq. (ITTI) . which was derived for J 2 = 0. These extra 
terms result from the fact that now the detector’s 


The event probabilities will, again, be used to con¬ 
struct the joint frame probabilities P{Ai, Bj). 

As before, we simplify our results by assuming 
that we can neglect edge effects. Using this assump- 
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tion, we find that 


p{A,b,) 

Bi+l) 


p{A+,,b,) 

P{A,B,) 


results. In all of what follows, we assume that only 
Jq and Ji are not equal to zero and that r]A = r]B- 

PioVoiVUPm)''~\ (B5) 


where |* — i| > 1- The conditional probabilities and 
all the relevant entropic quantities can now be cal¬ 
culated as before. 


Appendix C: Comparison of the exact and the 
approximate results for jitter 

In this Appendix, we compare the approximate 
results for jitter to the longer, but more accurate 


In general, the frame edges influence the probabil¬ 
ities for each pattern, e.g. P{Ai,Bj) ^ P{A^,Bj). 
This is because we analysis each frame separately. 
We thus loose information about what happens in 
the time-bins directly before the beginning of each 
frame. The probabilities shown in Eq. m are de¬ 
rived by neglecting the edges. When we include the 
edges, we find that the probabilities become 


P{A,,B,) = 

P{A,,Bi) = = 

P{Am,Bm) = 

PiA,,B2) = P{A2,B,)=V^oPUPoi''f~^ 

P{A,,B,) = P{A,,B,) = 1<J<N, 

P{A,,Bn) = P{AN,B,) = {Vi.f{P^o''f~\ 

= P{A,+uB,)=VioVllo{Pm'')^~\^<i<N-l 
P{A,,Bn) = P{AN,Bi) = {V^,f{P^^^)^~\l<i<N, 

= P{An,Bn-i) = 

P{A,B,) = 


(Cl) 


There will be A^ — 2, P{Ai,Bi) terms for 1 < z < iV. 
Similarly, there are iV — 3, P{Ai,Bi), P{Ai,Bi), 
P{Aj,Bi\f) and P{Aj^ ,Bj) terms, where 2 <i < N 
and 1 < j < iV—1. One can also verify that there are 
A^ — 1 terms such as P{Aj, Bj+i) and P{Aj+i, Bj), 
where j = 1, N — 1. Finally, the number of re¬ 
maining terms can be found by recalling that the 
joint probability contains a total of N'^ different out¬ 
comes. 

Equation (ICll) is significantly more complicated 
than m- Using these probabilities, we calculate 
Hd{A : B\Ka = = 1) and compare this 

with the approximate result given in Figure 

[5] (a) shows a direct comparison for rjA = tIb = 0.3, 


A = 5.33 X 10-4, g = 3.9 X 10“® and Ji = 0.4 as a 
function of N. The solid black curve is exact expres¬ 
sion, while the dashed red curve is the approximate 
expression. The percentage difference between the 
exact and approximate results is shown in Fig. [ 6 ](b). 
We see that the agreement between the two results 
is excellent for large N. Somewhat surprisingly, the 
approximation is accurate to less that 1 % for frames 
as small as N = 8. The match between the exact 
results and the approximate ones holds also for dif¬ 
ferent values for rjA, A and q. For example, for 
TjA = r]B = 0.7, A = 5.33 x 10"®, q = 6.53 x 10“® 
and Ji = 0 . 1 , then we find a percentage difference 
of less than 0.001% for N = 10. 
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FIG. 6: Comparison of the exact expression for Hd{A : 
B\Ka = ^,Kb = 1) with the approximate expression 
Fig. (a) shows plots of the post-selected infor¬ 
mation Hd{A : B\Ka ~ ^,Kb = 1), as a function of 
the frame size N. The black curve is the exact result, 
while the dashed red curve is the approximate result. 
Fig. (b) shows a percentage difference between the ex¬ 
act and approximate results, as a function of the frame 
size. All curves are for rjA = Vb ~ 0.3, A = 5.33 x 10“"*, 
q — 3.9 X 10“® and Ji = 0.4 (color online). 
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